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Reviews 


The novel glycoproteins of 
coronaviruses 


Lawrence Sturman and Kathryn Holmes 


The two glycoproteins of coronavirus envelopes have different functions in intracellu- 
lar budding, cytopathic effects and virus infectivity, and have markedly different 
biochemical properties. 


Glycoproteins of viral envelopes medi- 
ate many of the biological activities of 
enveloped virions, such as attachment of 
virus peplomers (spikes) to receptors on 
the cell surface, entry of virions into cells 
by fusion of the viral envelope with cel- 
lular membranes, virus-induced cell 
fusion, and cellular and humoral 
responses to viral infection. In this brief 


review we will summarize recent studies . 


on the envelope glycoproteins of coro- 
naviruses, emphasizing the biochemical 
characteristics associated with their dif- 
ferent roles in intracellular budding of 
virions, virus-induced cell fusion and 
immune responses to coronavirus 
infections. 

A model for the structure of the 
coronavirus virion is shown in Fig. 1. 
Coronaviruses are large, enveloped vir- 
uses with helical nucleocapsids that con- 
tain RNA genomes of positive or 
message-sense polarity'. The viral envel- 
ope is a lipid bilayer with two viral- 
encoded glycoproteins which we have 
called El and E2. The glycoprotein 
which forms the viral peplomers or 
spikes, E2, is similar to the spike glyco- 
proteins of other enveloped RNA vir- 
uses. In contrast, the membrane- 
associated glycoprotein, El, which 
appears to function like the nonglycosyl- 
ated M proteins or orthomyxo-, para- 
myxo-, and rhabdoviruses, differs 
markedly from other viral glycoproteins 
in its structure, processing, and intra- 
cellular transport. The major differences 
between the Ei and E2 glycoproteins of 
the A59 strain of murine hepatitis virus 
(MHV-AS9) are summarized in Table I 
and will be discussed in detail after a 
summary of the virus replication 
strategy. 
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Upon entry into the cell, the positive- 
strand RNA genome (M, 5.4 x 10°) 
apparently acts as messenger RNA 
(mRNA) for the synthesis of RNA- 
dependent RNA _ polymerase’. This 
enzyme transcribes the genome, forming 
a genome-sized, negative-strand tem- 
plate’. From this template, new genomic 
RNA, and six subgenomic mRNAs are 
transcribed’°, These capped and poly- 
adenylated RNAs all share a common 3’ 
end and extend for different lengths in 
the 5’ direction, forming a set of seven 
overlapping transcripts’. Recent studies 
by Lai e¢ al.!°!! and Spaan et al.'2 show 
that a leader RNA at least 70 
nucleotides in length, encoded by the 3’ 
end of the negative-strand template, is 
added to the 5’ end of each mRNA and 
genomic RNA. Although it is not yet 
certain how the leader RNA is added to 
the noncontiguous sequences of the 
mRNAs, the available evidence suggests 
that the leader is synthesized separately 
and utilized as a primer for transcription 
(Ref. 13 and M. Lai, personal communi- 
cation). Jn vitro translation of the iso- 
lated viral mRNAs suggests that only 
the gene at the 5’ end of each MRNA 
species is translated, yielding a single 
polypeptide'+"®, 


E2, the spike glycoprotein 

Translation in vitro indicates that syn- 
thesis of E2 is directed by mRNA 3 (M, 
2.6 X 10%). E2 is synthesized on ribo- 
somes bound to the rough endoplasmic 
reticulum (RER) and is co-transla- 
tionally glycosylated by transfer of 
N-linked oligosaccharides from dolichol 
phosphate carriers to asparagine resi- 
dues*>. The apoprotein appears to be 
about M, 0.12 x 10° as shown by experi- 
ments in which attachment of the 
oligosaccharides was prevented by 
tunicamycin, and from translation in 
vitro in the absence of microsomal mem- 
branes'*!7.'8_ After glycosylation, E2 has 
an apparent M. of 0.18 x 10° (180 K). 


Palmitic acid is covalently bound to E2, 
and this acylation probably occurs as the 
glycoprotein is transported through the 
Golgi apparatus. 

Virions form by budding from RER 
and Golgi membranes and are released 
from the cell apparently via the cellular 
secretory apparatus. Because viral com- 
ponents assemble at RER and Golgi 
membranes, subsequent processing of 
E2 may occur as intact virions are trans- 
ported through intracellular compart- 
ments. Pulse-labeling shows that most of 
the E2 is incorporated into virions and 
released from the cell within 2 hours of 
synthesis”. A small proportion of E2 
which is not incorporated into virions is 
transported to the plasma membrane 
where it may participate in cell fusion 
and render cells susceptible to cell-medi- 
ated cytotoxicity. This route for intra-, 
cellular transport of E2 to the plasma 
membrane is the same as that of the 
envelope glycoproteins of orthomyxo-, 
paramyxo-, rhabdo- and alphaviruses. 

Proteolytic cleavage of 180K E2 by 
host-cell proteases to form products of 
M,0.09 x 10° apparently occurs as a late 
step in transport of virions and E2 to the 
plasma membrane. Two such 90K spe- 
cies, which we have called 90A and 90B, 
can be separated by hydroxyapatite 
chromatography”. These cleavage prod- 
ucts have different amino acid composi- 
tions, and only the 90A form is acylated. 
On virions released from different cell 
types, the ratio of 180K E2 to 90K E2 
varies considerably, presumably reflect- 
ing differences in host-protease activity. 
Cell fusion occurs in coronavirus 
infections in vivo and in vitro and is 
mediated by E2?. Host-dependent cleav- 
age of E2 may be an important determi- 
nant of coronavirus virulence. 

Proteolytic cleavage of E2 to 90K 
forms is required for rapid cell fusion by 
exogenous virus”. Treatment of virions 
with trypsin, elastase, thermolysin or 
chymotrypsin activates cell-fusing ac- 
tivity so that cells can be fused in one 
hour in the absence of protein synthesis 
(F. Baker, C. Ricard and L. Sturman, 
unpublished results). As sequence data 
become available from molecular clon- 
ing and amino acid sequencing, it will be 
of considerable interest to compare the 


‘amino acid sequences near the protease 


sites on E2 with sequences of fusion 
glycoproteins of other viruses such as 
the F glycoprotein of paramyxoviruses 
and the HA glycoprotein of influenza 
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HELICAL 
NUCLEOCAPSID 


Fig. 1. Model of a coronavirus. The viral nucleocapsid is a long, flexible helix composed of the (+)-strand 
genomic RNA (M, 5-7 x 105) and many molecules of the phosphorylated nucleocapsid protein, N (M, 
0.05-0.06 x 10°). The viral envelope includes a lipid bilayer derived from intracellular membranes of the 
host cell and two viral glycoproteins, El (M, 0.02-0.03 x 10°) and E2 (M, 0.18-0.20 x 10°). The 
peplomers are composed of E2. The membrane glycoprotein El penetrates through the lipid bilayer and 


interacts with the nucleocapsid within the virion. 


virus. In these cases, cleavage yields a 
new amino terminus with a markedly 
hydrophobic sequence of amino acids, 
which may participate in the cell-fusion 
reaction. 

The E2 glycoprotein also appears to 
be responsible for the unusual 
pH-dependent thermolability of corona- 
viruses. There is a rapid loss of corona- 
virus infectivity above pH 6.5 associated 
with aggregation of the peplomeric 
glycoprotein. This suggests that there 
may be a pH-dependent conformational 
change in the E2 glycoprotein”. The 
amino acid sequence of E2 will soon be 
known from cloning and sequencing 
studies in progress in several labora- 
tories. Comparison of this glycoprotein 
from a positive-stranded coronavirus 
with those of the large negative-stranded 
orthomyxo-, paramyxo- and rhabdo- 
viruses may reveal important structural 
and functional determinants of envelope 
glycoproteins, and may clarify the evolu- 
tionary relationship of coronaviruses 
with these other virus groups. 


E1, the membrane glycoprotein 

The molecular characteristics of El 
differ from E2 and other viral envelope 
glycoproteins in many ways (Table I). 
Messenger RNA 6 (M_0.9 x 10°) which 
encodes E1 can be translated in vitro in 
the absence of membranes. When 


microsomal membranes were added 
after protein synthesis had begun, 
Rottier et al. found that about 65-70% 
of the El molecule was synthesized 
before the protein was inserted into the 
lipid bilayer”, suggesting that an inter- 
nal signal sequence was used. Although 
El lacks an amino terminal signal 
sequence, the cellular signal recognition 
particle (SRP) is required for integration 
into microsomal membranes, since addi- 
tion of SRP to a cell-free translation sys- 
tem blocks elongation of E1 (P. Rottier, 
J. Armstrong, and D. Meyer, in press). 
The block is released by addition of 
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microsomes, concomitant with assembly 
of the protein. 

The primary structure of El deduced 
from analysis of cloned cDNA derived 
from viral mRNA reveals that there are 
two long sequences of uncharged amino 
acids which represent potential mem- 
brane-spanning regions of the protein”. 
Investigation of the disposition of E1 in 
microsomal membranes by digestion 
with protease indicates that only small 
portions at the N- and C-termini (plus a 
portion near the center of the molecule 
of M, 500) are exposed or accessible to 
protease in the lumenal and cytoplasmic 
domains respectively”. 

Glycosylation of MHV El! differs 
from that of most other viral glyco- 
proteins in several ways’. First, it is 
glycosylated post-translationally rather 
than co-translationally like N-linked 
glycoproteins. Pulse-labeling studies of 
infected cells show that E1 is synthesized 
as a 20K apoprotein in association with 
the RER, and apparently migrates to 
the Golgi apparatus where glycosylation 
occurs after about 20-30 minutes. The 
oligosaccharides attached to E1 are dif- 
ferent from those on E2 and most other 
viral glycoproteins. Glycosylation of E1 
is not inhibited by tunicamycin, and the 
oligosaccharides are O-linked to serine 
and threonine residues at the amino ter- 
minus which projects from the external 
side of the viral envelope. Niemann and 
his colleagues have shown that the struc- 
tures of these O-glycosidally linked 
oligosaccharides are identical to those 
of glycophorin A, a major sialo- 
glycoprotein of the human erythrocyte 
membrane*. El and one type of 
glycophorin A possess the same amino 
terminal tetrapeptide sequence Ser-Ser— 
Thr—Thr. This region of the glycophorin 
molecule with its associated oligosac- 
charides constitutes the blood group M 


Table I. Structure and function of coronavirus MHV-AS9 glycoproteins?* 


Spike glycoprotein (E2) 


Membrane glycoprotein (E1) 


Mol. wt 0.18 x 106 0.023 x 106 
Cleavage fragments 90A, 90B _ 
% in external domain >90% <15% 
mRNA no. (mol. wt) 3 (2.6 x 10%) 6 (0.9 x 10°) 
No. of epitopes 4 (MHV-JHM) 2 (MHV-JHM) 
Processing N-glycosylation, cleavage, acyla- O-glycosylation 
tion 
Intracellular transport RER to Golgi to plasma mem- RER to Golgi 
brane 
Functions Cell attachment Determines budding site 
Cell fusion Forms viral envelope 


pH-dependent thermolability 


Neutralization (complement- 


independent) 


Interacts with nucleocapsid and 
E2 

Neutralization (complement 
dependent) 


Cell-mediated cytotoxicity 
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determinant, and purified E1 also pos- 
sesses blood group M activity. 

Intracellular transport of E1 differs 
from that of E2. E1 is transported to the 
Golgi apparatus where it is glycosylated 
and accumulates in thiamine pyrophos- 
phatase-positive cisternae (Ref. 25, and 
E. Dollar and K. Holmes, unpublished 
results). Unlike E2, E1 is not transported 
to the plasma membrane. The limited 
intracellular migration of El appears to 
determine the intracellular budding site of 
coronaviruses. Virions mature by bud- 
ding in RER and Golgi membranes, pre- 
sumably because this is the location of the 
E1 glycoprotein. Pulse-chase _ studies 
show that early in infection the synthesis 
of E1 is balanced with its release in vir- 
ions. El is the only viral glycoprotein 
required for virus budding, since virions 
containing only nucleocapsid (N) and E1 
proteins can be assembled and released in 
tunicamycin-treated cells”!. 

As a consequence of intracellular bud- 
ding, complete virions may be formed 
within the cell prior to insertion of large 
amounts of E2 glycoprotein into the 
plasma membrane. This would mean that 
enveloped virions are completed before 
the infected cell becomes susceptible to 
attack by humoral or cell-mediated 
immune responses directed against E2 on 
the cell surface. Clearly intracellular bud- 
ding may provide an important mecha- 
nism for persistent infection. Indeed, 
coronaviruses cause persistent infections 
both in vitro and in vivo. 

To analyse the signals responsible for 
the intracellular transport of El to the 
Golgi apparatus, cells have been microin- 
jected with purified mRNA 6 or trans- 
fected with the cloned E1 gene and the 
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distribution of E1 in these cells has been 
characterized by indirect immuno- 
fluorescence microscopy. Preliminary 
results indicate that at this level of resolu- 
tion the distribution of E1 is identical to 
that of E1 in virus-infected cells. It is clear 
that the intracellular transport signal for 
E1 is not determined by the O-linked 
glycosylation of E1, since the El glyco- 
protein of the avian coronavirus IBV, 
which contains N-glycosidically linked 
oligomannosidic _ side-chains, also 
accumulates in the Golgi apparatus’. 
Comparison of the predicted amino acid 
sequences of MHV-AS59 E1 and the IBV 
glycoprotein reveals that the sequences in 
the membrane-spanning region are highly 
conserved. Although their cytoplasmic 
domains are much less alike, both are rich 
in basic residues?’. It is likely that studies 
with hybrid E1 proteins or site-directed 
mutagenesis of the cloned El gene will 
lead to identification of the signals on the 
E1 protein which determine its route of 
intracellular transport. 

The chemical and physical properties 
of El are unusual, as might be expected 
from such a unique molecular structure. 
E1 is a highly hydrophobic glycoprotein 
which aggregates on heating at 100°C in 
SDS. E1 aggregates and binds to isolated 
nucleocapsids at 37°C. Aggregation of El 
molecules with each other in the plane of 
the lipid bilayer may be important in vir- 
ion formation at the budding site. This 
lateral self-aggregation may be similar to 
that of the nonglycosylated M proteins of 
paramyxoviruses which has recently been 
reported’’, El may form long tubules, 50 


_Ihm in diameter, which are found in the 


lumen of the RER in cells late in infection 
with MHV, or in cells infected in the 
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presence of tunicamycin’. 

No details of the interaction between 
the El and E2 glycoproteins have yet 
been elucidated. We surmise, however, 
that at a domain in the membrane where 
El molecules have inserted and aggre- 
gated, the E2 glycoprotein on the 
lumenal side of the RER or Golgi mem- 
branes may assemble into spikes perpen- 
dicular to the lipid bilayer. The helical 
nucleocapsid of the coronavirus probably 
binds to the cytoplasmic domain of the El 
molecule, anchoring the flexible helix to 
the forming viral envelope. This interac- 
tion may stabilize the helix in a tight coil 
which inhibits transcription or translation 
of the plus-stranded genomic RNA. The 
nucleocapsids seen in the cytoplasm of 
infected cells appear to be much more 
flexible and possibly more loosely coiled 
than the nucleocapsids in the virion. Thus 
E1 might play a role in inhibition of tran- 
scription and in stabilization of the virion- 
associated nucleocapsid. El may also 
function in uncoating. Following fusion of 
the viral envelope with cell membranes, 
the binding between El and viral 
nucleocapsid may be released so that the 
nucleocapsid can be used in the 
cytoplasm for translation, and then for 
transcription. 


Conclusion 

The glycoproteins of coronaviruses are 
significant both for understanding of viral 
functions and as models for glycoproteins 
with different patterns of membrane 
insertion, glycosylation, and intracellular 
transport. The E2 glycoprotein most 
closely resembles the spike glycoproteins 
of enveloped viruses with negative- 
stranded RNA genomes in structure, 
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function, and requirement for protease 
activation. The E1 glycoprotein may rep- 
resent a new class of glycoproteins resem- 
bling in some ways cellular glycoproteins 
associated with the plasma membrane, 
and in other ways, cellular glycoproteins 
associated with the Golgi apparatus. 
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a,-Antitrypsin and the serpins: 
variation and countervariation 


Robin Carrell and James Travis 


a,-Antitrypsin is a plasma protein which protects elastic tissue from proteolytic 
attack. Consequently, genetic deficiency, or the oxidation of its reactive centre in 
cigarette smokers can result in the degenerative ‘lung disease emphysema. Structural 
studies explain the mechanisms involved and have also drawn attention to a new 
family of serine proteinase inhibitors. The specificity of each of these inhibitors is 
primarily dependent on a single amino acid at its reactive centre. Site-directed 
mutagenesis is enabling the production of specifically designed inhibitors for 
therapeutic use, including an improved replacement for a,-antitrypsin deficiency. 


Human plasma contains several inhibi- 
tors of proteolytic enzymes that together 
form some 10% of its protein content!. 
A major example is the broad-spectrum 
inhibitor, c2-macroglobulin, previously 
described in T/BS (James, 1980). How- 
ever, the best studied of the inhibitors, 
and the one present in greatest concen- 
tration in plasma, is the more specific- 
ally targeted «,-antitrypsin (also called 
a \-proteinase inhibitor). Although this 
will inhibit most of the serine protein- 
ases, it is really an anti-elastase and its 
prime physiological task is the inhibition 
of elastase released by neutrophil 
leucocytes. 

The function of a,-antitrypsin is 
known because its genetic deficiency’ 
leads to a premature breakdown of con- 
nective tissue, to give a loss of elasticity 
in the lungs — a condition known as 
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emphysema. Extensive studies on the 
molecular pathology of this genetic 
deficiency culminated recently in the 
completion of the structure?* of the 
normal and variant molecules. 

An unexpected bonus from these 
structural studies is that o,-antitrypsin 
proved to be the archetype of a new 
superfamily> of homologous proteins. 
We have called this family, for con- 
venience, the serpins since it is primarily 
a group of Serine Proteinase /nhibitors. 
It is becoming clear that the lessons 
learnt from q,-antitrypsin are also 
applicable to the other members of the 
serpin family. One important con- 
clusion, already apparent, is that the 
inhibitory specificity of the serpins is in 
each case primarily defined by a single 
amino acid at the reactive centre of the 
molecule. This is strongly supported by 
previous work® on the unrelated plant 
proteinase inhibitors (Table I). An 
exciting corollary is the ability to re- 
design the specificity of inhibition by a 
single substitution at the reactive centre. 
This ability to design inhibitors for a 
specific purpose is already being de- 
veloped, for therapeutic purposes, using 
simple recombinant-DNA modifications 
of «,-antitrypsin’®. 


